This page contains my current progress using persistent homology analysis based on the dataset from the study Dispersal enhances beta diversity in nectar microbes by Rachel Vanette and Tadashi Fukami.
The original study by Vanette and Fukami investigated the effect of dispersal on the diversity of the microbial communities of the Mimulus aurantiacus flower found in Jasper Ridge Biological preserve. In particular, they wanted to see how varying levels of dispersal affects beta diversity, the “the variation in species composition among local communities.” Increasing opportunities for dispersal between microbial communities within a region are thought to have a homogenizing effect on the communities as the members of individual communities are free to move between the communities. Thus, over time, the communities within the region will all be more and more identical, thus decreasing beta diversity.
To study this, Vanette and Fukami, manipulated dispersal opportunities to the floral nectar of Mimulus aurantiacus. The floral nectar of M. aurantiacus is initially sterile (owing to the high sugar content), and the main dispersal vector of microbes into this nectar is via Calypte anna, or Anna’s hummingbird. These conditions make it an ideal system to study the effects of dispersal on beta diversity.
The flowers were separated into three treatment groups. These treatments, in order of decreasing dispersal opportunity, are: exposed flowers, with no barriers to dispersal; caged flowers, which prevented hummingbirds from accessing the flowers; and caged flowers, which prevented hummingbirds and insects from accessing the plants, leaving wind as the only dispersal vector to the affected flowers. These flowers were located in various places across Jasper Ridge Biological Preserve. The figure below has some sample locations.
(To add: data collection, bioinformatics methods)
They calculated the beta diversity of the microbial communities in each treatment by using the multivariate method based on dissimilarity matrices as defined by Legendre and Caceres. The authors found that the beta diversity of the floral nectar microbial communities in the exposed treatment group was higher than in the treatment groups with lower opportunity for dispersal. While the authors outline several reasons why increased dispersal may increase beta diversity, 
(To add: review of persistent homology)
I’m interested in applying persistent homology to microbial communities. A significant challenge for persistent homology is to relate the analysis back to the system of study. For example, in protein structure analysis (and other systems where spatial structure is important), a loop in the dataset might literally be a loop in the protein structure of study. But what do these topological structures represent in a microbial community?
This system in particular is interesting because there is a clear signal to detect - the exposed treatment has more beta diversity than the other two treatments, which are very similar in terms of beta diversity. Can persistent homology detect this same signal? And can we use this as a starting point to help us interpret what these bars mean in the context of microbial communities?
Barcode diagram for all treatments together:

The persistence diagrams for the bagged and caged treatments:

The two diagrams above are very similar. In the first dimension, the barcodes have a similar shape, and there is a little smattering of second dimensional features. The similarity in the barcode diagrams might reflect the similarity in beta diversity between the two treatments. As shown in the figure above, the beta diversity found in the caged and bagged treatments were very similar to each other.
Of note is that the bagged samples have noticably more 1st-dimension barcodes than the caged samples. While there could be a meaningful biological reason for this, I believe the more numerous barcodes are just due to the fact that there are more bagged samples (104) than caged samples (73).

What’s interesting here i


Paul Villanueva
Ph.D. Student - Bioinformatics and Computational Biology
Iowa State University, Ames, IA.